A Parallel Implementation of the Nonsymmetric QR Algorithm for Distributed Memory Architectures

نویسندگان

  • Greg Henry
  • David S. Watkins
  • Jack J. Dongarra
چکیده

One approach to solving the nonsymmetric eigenvalue problem in parallel is to parallelize the QR algorithm. Not long ago, this was widely considered to be a hopeless task. Recent e orts have made signi cant advances, although the methods proposed up to now have su ered from scalability problems. This paper discusses an approach to parallelizing the QR algorithm that greatly improves scalability. A theoretical analysis indicates that the algorithm is ultimately not scalable, but the nonscalability does not become evident until the matrix dimension is enormous. Experiments on the Intel ParagonTM system, the IBM SP2 supercomputer, and the Intel ASCI Option Red Supercomputer are reported.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Distributed Memory Implementation of the Nonsymmetric QR Algorithm

The QR algorithm is the crux of the serial nonsymmetric eigenvalue problem. Recent eeorts to parallelize this algorithm have made signiicant advances towards solving the parallel nonsym-metric eigenvalue problem. Most methods to date suuer a scalability problem. In this talk we discuss an approach for parallelizing QR which overcomes many of the disadvantages to date. We also give insights into...

متن کامل

A Parallel Implementation of the Nonsymmetric Qr Algorithm for Distributed Memory

One approach to solving the nonsymmetric eigenvalue problem in parallel is to parallelize the QR algorithm. Not long ago, this was widely considered to be a hopeless task. Recent efforts have led to significant advances, although the methods proposed up to now have suffered from scalability problems. This paper discusses an approach to parallelizing the QR algorithm that greatly improves scalab...

متن کامل

Polynomial Acceleration for Restarted Arnoldi Iteration and its Parallelization

We propose an accelerating method for the restarted Arnoldi iteration to compute a number of eigenvalues of the standard eigenproblem Ax = x and discuss the dependence of the convergence rate of the accelerated iteration on the distribution of spectrum. The e ectiveness of the approach is proved by numerical results. We also propose a new parallelization technique for the nonsymmetric double sh...

متن کامل

LAPACK Working Note # 216 : A novel parallel QR algorithm for hybrid distributed memory HPC systems ∗

A novel variant of the parallel QR algorithm for solving dense nonsymmetric eigenvalue problems on hybrid distributed high performance computing (HPC) systems is presented. For this purpose, we introduce the concept of multi-window bulge chain chasing and parallelize aggressive early deflation. The multi-window approach ensures that most computations when chasing chains of bulges are performed ...

متن کامل

A novel parallel QR algorithm for hybrid distributed memory HPC systems

A novel variant of the parallel QR algorithm for solving dense nonsymmetric eigenvalue problems on hybrid distributed high performance computing (HPC) systems is presented. For this purpose, we introduce the concept of multi-window bulge chain chasing and parallelize aggressive early deflation. The multi-window approach ensures that most computations when chasing chains of bulges are performed ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • SIAM J. Scientific Computing

دوره 24  شماره 

صفحات  -

تاریخ انتشار 2002